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Abstract —We consider communication over the AWGN chan¬ 
nel with a transmitter whose battery is recharged with RF energy 
transfer at random times known to the receiver. We assume that 
the recharging process is i.i.d. Bernoulli. We characterize the ca¬ 
pacity of this channel as the limit of an n-letter maximum mutual 
information rate under both causal and noncausal transmitter 
knowledge of the battery recharges. With noncausal knowledge, it 
is possible to explicitly identify the maximizing input distribution, 
which we use to demonstrate that the capacity with noncausal 
knowledge of the battery recharges is strictly larger than that 
with causal knowledge. We then proceed to derive explicit upper 
and lower bounds on the capacity, which are within 1.05 bits/s/Hz 
of each other for all parameter values. 

I. Introduction 

There has been significant recent progress in building wire¬ 
less radios that possess no conventional batteries but are pow¬ 
ered with wireless energy transfer, with latest developments 
reporting smaller device sizes, better harvesting efficiencies 
and increased communication ranges and data rates m, El. 
Eor example, the ant-sized radios of III use the energy 
provided through the downlink channel in order to transmit 
over the uplink channel. We model communication with such 
externally powered transmitters by using a simple model. See 
Fig.m Here a transmitter equipped with a battery of size B 
is communicating to a receiver over the AWGN channel. The 
transmitter’s battery is recharged at random times; we assume 
that at each time t, the battery is recharged with probability 
p independent of previous time instants. We assume that the 
recharging times are known either causally or noncausally 
both at the transmitter and the receiver. While it is natural 
for the transmitter to be (at least causally) aware of its battery 
recharging times, the knowledge of the receiver is motivated 
by applications such as 12,12 where it is the receiver that 
powers the transmitter. 

The difficulty in characterizing the capacity of this setup lies 
in the fact that although the channel between the transmitter 
and the receiver is memoryless, the energy constraints on the 
transmitter lead to a random state for the system captured 
by the energy level of the battery. This state has memory 
and is input-dependent and even though the receiver can 
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Fig. 1. System model. 


track the battery recharging times, the energy state of the 
battery, and therefore the state of the system, is known causally 
only at the transmitter but not at the receiver. We show that 
this state-dependent system is nevertheless equivalent to a 
conceptually simpler memoryless channel which we call the 
clipping channel. The clipping channel admits real vectors 
as inputs, and outputs a clipped version of the input vector 
corrupted by white Gaussian noise. The clipping length is 
random and follows a geometric distribution. The clipping 
channel is a memoryless channel with i.i.d. states. Intuitively, 
each use of this channel corresponds to one epoch over the 
original channel with random battery recharges (RBR), where 
an epoch is the time period between two consecutive battery 
recharges. Using this equivalence, we provide an expression 
for the capacity of the RBR channel in Fig. [T] and find an 
explicit formula for the approximate capacity of this channel 
which we show is within 1.05 bits/channel use of the true 
capacity for all parameter values. 

This approximation is obtained by connecting the 
information-theoretic capacity of this channel to its long-term 
average throughput under optimal online power control. This 
latter communication-theoretic formulation of the problem 
assumes that there is an underlying transmission scheme which 
when allocated power Pt at time time t yields an instantaneous 
rate ^ log(l-l-Pt) and aims to maximize the long-term average 
throughput of the system by developing optimal online power 
control strategies Ei-inni. While only dynamic programming 
solutions are available for the online power control problem 
for general energy harvesting systems ini-m, we are able 
to explicitly characterize the optimal online power control 
strategy and the corresponding long-term average throughput 
for the specific model of interest here. This in turn yields an 
explicit formula for the approximate capacity of this system. 









In follow-up work, which we make simultaneously available 
on arXiv D2, we were able to generalize the approach of the 
current paper to understand the capacity of energy harvesting 
systems powered by general i.i.d. energy arrival processes. 
While the results of m apply to the current case, we believe 
the results we develop in this paper specifically for the RBR 
channel are still of interest for a number of reasons: 1) The 
results we provide for this specific setup are stronger than 
those that follow from ifTSl . For example, while both lITSl and 
the current paper characterize the capacity of the channel as an 
n-letter mutual information rate, the n-letter characterizations 
we obtain in this paper are much more explicit. This allows 
us, for example, to identify the maximizing input distribution 
for the n-letter expression when battery recharging times are 
known noncausally at the transmitter, which in turn allows 
us to show that noncausal knowledge of the battery recharges 
at the transmitter can strictly increase capacity over causal 
knowledge, even though the receiver also knows the i.i.d. 
battery recharging times. This result can be surprising given 
that for channels with i.i.d. states known both at the transmitter 
and the receiver, noncausal and causal knowledge of the 
states lead to the same capacity. Furthermore, we are able to 
explicitly solve the online power control problem for the RBR 
channel while ifTSl provides an approximately optimal online 
power control policy for the general case, which overall leads 
to an approximation of the capacity within 3.85 bits/channel 
use, while the approximation in the current paper is within 
1.05 bits/channel use. 2) The generalization approach in 1131 
reveals the current setup as a canonical example for energy 
harvesting communication which is simple but yet captures 
most of the challenges involved in the general scenario. 3) 
Due to increasing interest in applications with external battery 
recharges, we believe the current setup can be of interest in 
its own right. 

A. Relation to Prior Work 

The setup we consider in this paper corresponds to a special 
case of the energy-harvesting communication channel, the 
capacity of which, despite significant recent interest M- 
ED, remains an open problem. In particular, El considers a 
noiseless binary channel with a unit-sized battery where the 
battery recharges are known causally only at the transmitter. 
Our model resembles theirs in the fact that the energy ar¬ 
rival process is i.i.d. Bernoulli and each energy arrival fully 
recharges the battery. However, it is more general in the fact 
that we consider noisy channels and battery size and the the 
input alphabet are not matched to each other. With a binary 
channel and unit battery, information can be only encoded in 
the timing of the unit-energy pulse which makes the setup of 
El equivalent to a timing channel. In our current case with an 
arbitrary battery size and continuous inputs, information can be 
encoded through real valued codewords and achieving capacity 
requires to also devise an optimal power control strategy. 
The most closely related reference to our work is im which 
considers an i.i.d. Bernoulli energy harvesting process where 
an energy packet of size E is harvested with probability p 


at each channel use and the transmitter is equipped with a 
battery of size B which can be either smaller or larger than 
E. (Reference ll20ll considers a special case of this model 
with p = 1 in which case the harvesting process becomes 
deterministic). Our model corresponds to a special case of 
the model in lfT9l with B < E. llT9ll provides upper and 
lower bounds on the capacity of this channel which are within 
2.58 bits/s/Hz without providing an explicit expression for 
the capacity. To be more precise, 2.58 bits/s/Hz is the gap to 
capacity when the receiver has no information of the energy 
harvesting process. When the receiver has side information as 
we assume here, the gap can be readily decreased by H{Et), 
the entropy rate of the energy harvesting process, which is 
at most 1 bit/s/Hz for Bernoulli arrivals. The contributions of 
the current paper with respect to El are: 1) we provide an 
explicit formula for the capacity by establishing an equivalence 
to the clipping channel; 2) we derive novel upper and lower 
bounds to the capacity in terms of a power control problem 
for which we provide an explicit solution; this decreases the 
capacity approximation gap to 1.05 bits/s/Hz; 3) we show that 
the capacity with noncausal knowledge of the energy arrivals at 
the transmitter is strictly larger than the corresponding causal 
capacity. 

Section In] describes our model for the channel with random 
battery recharges and Section [HI] contains the main results 
of the paper and the definition for the clipping channel. 
Section |IV] provides the proof of our main theorem, namely 
the equivalence to the clipping channel. In Section |V] we show 
that noncausal observations of the energy arrivals can strictly 
increase capacity, and in Section |Vl] we provide a derivation 
of capacity bounds. 

II. System Model 

We consider a transmitter powered by RF energy transfer 
which communicates to a receiver over an AWGN channel, 
i.e. the output at time t is Yt = Xt + Zt, where 2ft S K. is 
the input to the channel and Zt ^ A/^(0,1) is the noise. We 
assume that the transmitter has a battery with finite capacity B 
which is recharged with probability p at each channel use, i.e. 
the energy arrivals Et are i.i.d. Bernoulli RVs: 

B w.p. p 
0 w.p. 1 — p. 

The effort to shrink down the size of wireless sensors and 
actuators limits the amount of energy that can be harvested 
at any given time, as well as the capacity of the storage unit 
that can be accommodated by the device. This necessitates 
the recharging process to operate at a scale comparable to the 
symbol duration H]. The randomness in the energy transfer 
process can be due to fluctuations in the alignment of antennas 
and the position of nodes, as well as randomness in the energy 
transfer times. We assume that the recharging times are known 
causally to both the transmitter and the receiver. The knowl¬ 
edge of the energy arrivals at the receiver is motivated by the 
fact that often it is the receiver that powers the transmitter, and 
the transmitter can acknowledge its battery exceeding a certain 


threshold by sending a short pulse to simplify operation. We 
also consider the case of noncausal energy arrival information 
at the transmitter, mainly for comparison with the causal case. 

Under this model, energy of the channel input symbol at 
each time slot is limited by the available energy in the battery. 
Let Bt represent the available energy in the battery at time t. 
The system energy constraints can be described as 

\Xtf<Bt, (1) 

+ ( 2 ) 

This implies that at time t, either Bt = B w.p. p, or 
Bt — Bt-i — |Xt_ip w.p. 1 — p. We assume without loss 
of generality that Bq = B, which implies that we can also 
assume Ei = B w.p. 1. 

An (M, n) code for the random battery recharges (RBR) 
channel is a set of encoding functions ft and a decoding 
function g: 

ft:Mx£^^X, t = (3) 

g ■.y'^ X ^ M, (4) 

where £ = {Q,B}, X = y = M. wA M = {1,... ,M}. To 

transmit message w G Ai at time t = 1,... ,n, the transmitter 

sets Xt = ft{w,E*). The battery state Bt is a deterministic 
function of {X*~^,E*), therefore also of {w,E*). The func¬ 
tions ft must satisfy the energy constraint ([T]|; \ft iw,E*r< 
Bt{w,E*). The receiver sets W = 5 (U”, E^). The probability 
of error is 

1 “ 

pRBR _ _ ^ Pr(fU w \ w was transmitted). 

W — 1 

The rate of an {M,n) code is i? = A rate R is 

achievable if for every e > 0 there exists a sequence of (M, n) 
codes that satisfy > R — e and ^ 0 as n —^ cxj. 

The capacity Crbr is the supremum of all achievable rates. 

When noncausal energy arrival information is available at 
the transmitter, the symbol transmitted at time t can depend 
on the entire realization of the energy arrival process i?". In 
this case, equation Q becomes 

ft:Mx£^^X, t=l,...,n, 

with remaining definitions unchanged. 

III. Main Results 

The RBR channel described above has a random state Bt 
which depends on the input and the exogenous energy arrival 
process. An explicit expression for the capacity is so far only 
available in terms of the Verdu-Han framework IfTSl . However, 
we will see that this channel is conceptually equivalent to 
a clipping channel. In the sequel, we define a sequence of 
clipping channels parametrized by the length N of the input 
vector. For each N, the channel is memoryless and time- 
invariant, and lends to an almost trivial analysis of capacity. 

Definition 1 (Clipping Channel). The {N)-clippmg channel 
is a memoryless channel which at time i admits inputs 


= {X\^\...,X[^'^) e and outputs 

,. ■., Yi^’) G y^^^ = R^. The inputs must 
satisfy the energy constraint 

j=t 

Each use of the channel is associated with a state variable Li, 
called the clipping length. Li are i.i.d. RVs, independent of the 
input, and follow a geometric distribution with parameter p: 

Pi^Li = k) = {I - p)'"~^p, fc = l,2,... 

The states Li are known at the receiver but not at the 
transmitter. At channel use i, if Li < N, the channel output 
is given by 

^ IE ^ ^ Bi 

\0 ,j>L, 

where Zij ~ A/'(0,1) are i.i.d. for different i,j, independent 
of xj'^^ and Li. If Li > N, the channel outputs = 0. 

An {M,n) code for the clipping channel consists of encod¬ 
ing and decoding functions 

fl^^:M^x(^K i = l,...,n, (6) 

giN) . (3)(N))n X ^ (7) 

where £ = N, and AA = M}. The transmitted 

codeword is x[^'^ = fl^\w), i = 1,... ,n, and the encoding 
functions must satisfy < B. The decoded message 

is W = g(^)((y(^))", L"). The rate of the code is R = 
and the capacity is defined in the standard way as the 

supremum of all achievable rates. 

Intuitively, at each channel use the channel chooses an i.i.d. 
clipping length Li and outputs only the first Li components 
of the input vector under additive white Gaussian noise (The 
case Li > N and the corresponding behaviour of the channel 
are rather technicalities; in the sequel we will be interested in 
oo in which case the length of the input vector goes 
to infinity and the probability of Li > N goes to zero). The 
(A^)-clipping channel is a standard vector memoryless channel 
with i.i.d. state information Li available at the receiver. The 
capacity of this channel is well known and is given by 

max 

P pix^y, 

= max 

We can rewrite this expression in the following explicit form 

N 

N 

= max + (8) 

pi^ )■ i._i 

||x"|p<s 





where in the last line = {Xi,...,Xpf) (and X^ = 
(Xi,...,Xfc)), and = iZi,... , Zjv) is a vector with i.i.d. 
Gaussian entries, i.e., Zi ^ Af{0, 1) i.i.d for i = 1,..., N. 

It is easy to see that is monotonically increasing in N 
and is also bounded above (see also Proposition |2]i, therefore 
it has a limit, which we call Cdpi 

Cdp = Im = sup (9) 

Intuitively, taking N ^ oo gives a channel with infinitely 
long input and output, thus simulating an epoch in the original 
channel, i.e. the time interval between two successive battery 
recharges. Indeed, we show that Cdp is in fact, up to a constant 
factor, the capacity of the RBR channel Crbr. We bring the 
following theorem without proof; 

Theorem 1 (Channel Equivalence). 

Crbr = p ■ Cclp. 

Proof: See Section HVl ■ 

Although the two channels are clearly related, with each 
use of the clipping channel corresponding to one epoch over 
the RBR channel, the fact that they are equivalent may be a 
priori unclear. Indeed, these two channels have quite different 
characteristics; the first has an input-dependent state with 
memory which is causally known at the transmitter; the second 
is a simple memoryless channel with states unknown at the 
transmitter. The intuitive connection is that whenever there is 
a battery recharge in the first channel, the system resets, and 
any memory of the channel (which is embedded in the state 
of the battery) is erased. However, even with this intuition, it 
may be unclear why one could not benefit from transmitting 
the symbols in each epoch in a sequential manner (having 
as side information the time since the last battery recharge) 
as compared to one-shot transmission of the “epoch symbol” 
with no side information. The proof of the theorem formally 
argues that codes designed for one channel can be used over 
the other channel with similar performance. 

Using Theorem [T] and ([8]l, we obtain the capacity of the 
AWGN channel with random battery recharges; 

Corollary 1 (Capacity of the RBR Channel). The capacity of 
the channel defined in Section is given by 

N 

Crbr= lim max p^il - pf-^IiX'^■ X^ + Z'^). 

tV-S-OO ^ 

( 10 ) 

It is easy to extend this result to the case of noncausal 
observations of the energy arrival process. In this case, it can 
be shown that the channel is equivalent to a clipping channel 
with state Li available at both the transmitter and the receiver. 
The capacity of such a channel is also standard and is obtained 
by optimizing over all input distributions conditioned on the 
state; 

CcSoncausal = ^ax f (^) | L). 

||X(N)||2<5 


Using the equivalence and writing the above expression explic¬ 
itly (in a form analogous to ®) we get the following result; 

Theorem 2 (Noncausal Capacity). The capacity of the channel 
defined in Section\n\with energy arrival information available 
noncausally at the transmitter and the receiver is given by 

OO 

CRBR,noncausal = V/(l - p)"”' max + Z'^). 

\\X^\\'^<B 

( 11 ) 

Proof: See Appendix lAl ■ 

It is possible to explicitly identify the maximizing input 
distribution in the above expression by using the results 
of ll22l - ll24ll . which characterize the capacity of amplitude- 
constrained channels. In particular, ll24l shows that the max¬ 
imizing X^ in (dUi is distributed over a finite set of k- 
dimensional spheres with uniform phase, where the number 
of spheres is determined by the value of B (ex. when B is 
very small, X^ is uniformly distributed over a single sphere 
of radius '/B). Using this result, we suggest the following 
proposition. 

Proposition 1. Noncausal observations of the energy arrival 
process strictly increase capacity. That is, 

CRBR.noncausal ^ Crbr- 

Proof: See Section IV] ■ 

This result may be surprising given that for a memoryless 
channel with i.i.d. state S, the capacity with side information 
at both the transmitter and the receiver is given by I{X-,Y\S), 
whether the side information is available causally or non¬ 
causally. The difference here is that even though the battery 
recharges Et are i.i.d. and known to both the transmitter and 
the receiver, the state of the system is captured by Bt rather 
than Et, which has memory and is unknown to the receiver due 
to its input-dependence. The fact that noncausal knowledge 
of the energy arrivals strictly increases capacity can be also 
observed by using the upper and lower bounds on the causal 
and noncausal capacities developed below. 

Despite being relatively simpler than previous result^ (fTOl i 
and (fTTTi are difficult to compute explicitly. In particular, (fTOl i 
is a multi-letter expression that involves optimization over an 
infinite dimensional space. Therefore, we wish to find suitable 
approximations. More specifically, we provide an upper and 
a lower bound, separated by a constant gap of approximately 
1.05 bits; 

‘ In (25), we characterize the capacity of the general energy harvesting 
channel in the form 

c= lim - sup I(U^ ■,¥"), 

n —>oo Yi 

where the domain of the optimization problem is suitably defined. Note that 
the capacity expressions in Go) and CD are much more explicit, and in 
particular, it is this explicit form that allows us to identify the maximizing 
input distribution in CD. 


Bounds for p = 0.1 


Proposition 2 (Capacity Bounds). The capacity of the RBR 
channel is bounded by: 

C-ilog(y) (12) 

where 

^ - 1 

C= lim max Vp(l-p)* i-log(l + fO- (13) 
Si>t) 

Proof: See Section IVTl ■ 

It can be shown that the upper bound C in (fOl) corresponds 
to the online power control problem, extensively studied in 
the literature in the general framework of energy-harvesting 
channels |[3l-||5l. Here, one assumes that there is an underlying 
transmission scheme operating at a finer time-scale, such that 
allocating power P to this scheme yields an information rate 
r{P) = i log(l -h P), and focuses on the optimal power 
allocation policy satisfying the energy constraints on the 
transmitter. For the specific channel of interest here, this online 
power control problem can be explicitly solved. In particular, 
we apply the KKT conditions to the optimization problem 
in (foi l, to obtain the optimal values of £i (see Appendix O: 


£^ = 




N 


(14) 


where N is the smallest positive integer satisfying 
1 > il-pf[l+piB + N)]. 

This gives the following expression for C: 

l-p-{l-p)^{l-p + Np) 

+ -^-log 1 - p)- 

2p 

Combined with (fT2l i. this is the capacity of the RBR channel 
within 1.05 bits/channel use. 

It was shown in m that the capacity of an energy har¬ 
vesting channel with infinite battery size is ^ log(l -I- EfF^t]). 
Clearly, this is an upper bound to the capacity of our channel, 
and this can be readily obtained from the result of Proposi¬ 
tion |2] Using concavity of the log function in (fTJl l: 

C < lim max i log | 1 -h p{l — | 

{£i}f=i- 2 I ^ ) 

= ^log(l +pS), 

where the last step follows because the optimal values for the 
first line are £i = B and £i = Q for i > 2. lfT9l used this 
upper bound corresponding to infinite battery size to bound 
the capacity of the energy harvesting channel with Bernoulli 
energy arrivals. Fig. |2] illustrates that the upper bound we 



Fig. 2. Upper and lower bounds for p = 0.1. The shaded region indicates 
where the capacity of the energy harvesting channel can lie. 


provide here is strictly smaller than the infinite battery upper 
bound. Similarly, our lower bound here is based on the optimal 
power allocation strategy we characterize in (fT4l i. while the 
lower bound in lfT9ll is based on a suboptimal power allocation 
policy. 

Similar bounds can be obtained for (fTTI) . which we state in 
the following proposition. 


Proposition 3. The capacity of the RBR channel with non- 
causal energy arrival information is bounded by: 


Cnoncausal 




— CRBR.noncausal ^ Cnoncausah ( 15 ) 


where 


cx; , 

Cnoncausal = ^ 7?^ (1 - - log(l + B /k) . (16) 

k^l 

Proof: See Appendix iBl ■ 

In Appendix |B] we consider another lower bound for the 
capacity with noncausal energy arrival information at the 
transmitter, which is tighter than the one in (fTSl) . and plot 
it in Figure |3] together with the upper bound in (fT^ on the 
capacity with causal energy arrival information. It is clear from 
the graph that for some values of B, the noncausal capacity 
is strictly greater than the causal capacity, further illustrating 
the observation we state in Proposition [T] 


IV. Capacity of the RBR Channel; Prooe of 
Theorem[T] 

Recall Definition [T] of the (V)-clipping channel. This is 
a memoryless time-invariant channel with i.i.d. states known 
to the receiver. Now, assume there was a feedback link 
from the receiver to the transmitter that could feed back the 
channel state information to the transmitter in a strictly causal 
fashion. Since feedback cannot increase the capacity of a 
memoryless channel, the capacity of this new clipping channel 
with the state information Li available strictly causally at the 
transmitter is the same as that of our original clipping channel 
in Definition [T] We will consider this new clipping channel 
with strictly causal state information in the sequel and modify 












Bounds for p = 0.01 



Fig. 3. Noncausal lower bound and causal upper bound for p = 0.01. 
Noncausal capacity is strictly greater than causal capacity for some values of 

B. 


the definition of the encoding functions in (|6]l to depend on 
the past Li'. 

* = (17) 

such that the transmitted symbol at time i is = 

With this modification, we show that the 
clipping channel is equivalent to the RBR channel by showing 
first that p ■ Cdp < Crbr and then p ■ Cdp > Crbr. 


A. p ■ Ccip < Crbr 

We will show that if rate R is achievable for the (iV)- 
clipping channel for some N, then rate pR is achievable over 
the RBR channel. Fix e > 0. Let R < be an achievable 
rate for the (V)-clipping channel. Then there exists uq such 
that for every n > uq there exists an (M, n) code with 
probability of error < pe/2 and > R — e. 

Fix such n > no. Consider the following {M,n') code for 
the RBR channel. At time t, 1 < t < n', the transmitter has 
causal knowledge of the energy arrivals E*. Let Kt denote 
the number of energy arrivals up to time t, and denote by 
the energy arrival times, i.e. the times for which 
Et = B. Note that ATj G {1,..., f} is a random quantity and 
is a function of E*. Since we assume Ei = B (cf. Section HU), 
we always have Tq = 1. For each i = 1,..., Kt — 1, define 
Li = Ti — Ti-i. As n' —?> oo, these Li’s will behave as i.i.d. 
geometric RVs with parameter p. In order to communicate 
message w, upon observing the transmitter sends the 
following symbol at time t: 




Xt{w,E^) = 


t — Txt-i + I < N 


t — Tjit-i + 1 > W 


where X^^\w, L’' is the j-th element of the i-th symbol 
in the (W)-clipping channel code defined in (fTTI) . This 

codeword will satisfy the energy constraint O, since for 


each t. 


t-TKf-l 


Bt = B- 


\x_ 


Kt,j\ 


> I 1^ 

^ \^Kt,t-TKt-i + l\ ’ 


i=i 


because 1^^^ '' 




< B: a 


clipping channel code must satisfy the energy constraint (|5]). 

At the end of time slot n', the receiver observes V” 
and E’^', and forms and Define also 

Lk , = n' + 1 — Tk ,-!■ Define for f = 1,..., Ar„/: 



if Li < N, and = 0 if Li > N. Similarly, set = 0 
for i = Kn' + 1 ,... ,n (we will see shortly that the probability 
that Kn' < n should vanish). The receiver then sets W = 
L"), where W = the decoding function of 
the clipping channel code defined in O. 

To analyze the probability of error, we separate into two 
cases: Kn' < n and Kn' > n. When n' is chosen appropriately 
as a function of n, Kn' < n should occur with very small 
probability, and otherwise when Kn' > n, intuitively we 
have been able to transmit all symbols of the (V)-clipping 
channel codeword of length n and the channel we induce from 
to is similar to the (V)-clipping channel, 

therefore the decoder should give vanishing probability of 
error. Formally, 

pRBR ^ ^ pi Kn'>n) 

+ Pr(IF n Kn' <n) 

< Ft{W i-W n Kn'>n)X Pr(A:„, < n). (18) 

We start with the first term. Consider an (M, n) 
(V)-clipping channel code. For a given message w and 
clipping length realizations the input (j(JV))n 

is uniquely 

determined and therefore also the transition probability 
Z”). Denote P^^\£\w,C) as the proba¬ 
bility of error. The total probability of error for the code, over 
the (V)-clipping channel, is then 


1 


M 




w—1 


h-l 


We 


where P{E) = HEi Pi^) P{li) = p{l - p) 
choose n large enough so that < pe/2. 

Now going back to our RBR channel, consider a realization 
of the energy arrival process e" , and denote PRBR(f|w,e" ) 
as the probability of error given this realization and a message 
w, using the scheme described above. Then we can write the 
first term in (HI as: 

Pr(l^ n Kn'>n) 

1 ^ 

= XfE E ^(e”)^RBR(fk,eE, 


:k.„r>r, 






where kn’ denotes the number of energy arrivals in e" , and 
P(e" ) = — p)" (recall that ei is fixed), and 

PRBR(iS|rc, e" ) is the probability of error for the RBR channel 
using the scheme described above. 

For a sequence e" with at least n energy arrivals, we 
can define L"(e" ) = Z" as the sequence of lengths of the 
first n epochs, i.e. the times between consecutive energy 
arrivals. If there are exactly n energy arrivals, so that the last 
epoch is undefined, we define it to be In = n' — 
i.e. it lasts until then end of the sequence e" . Then we 
have PRBR(f |w, e" ) = P"(e"')), and denoting 

^ h, we have 

P(e"') = P(e^)P(e^;i) = ip(r)P(e4i). 

If kn' = n, then it is understood that 
-P(e^_i_i) = Pien'^i) = 1- We can write the above sum 
as 

^ P{e^')PRBR{£\w,e^') 

'.k^i'>n 

= E E ^(e”>RBR(fk,e"') 

L"(e"')=Z'‘ 

= E \p{np^^\£\w,n E 

Q-n' 

\<n' ^+1 

<^ip(Z")pW(fKP), 

i" F 

which gives Pr(lf^ ^ W D Kn' > n) < < s/2. 

The second term in (fTsT i can be bounded using the law 
of large numbers. Since Kn' = 1 + ^ Zi with Zi ^ 

Bernoulli (p). 


Pr(Pr„/ < n) = Pr 



n' — 1 


Y,z.< 




for n' large enough if ///P}^ < p, say //r\ = p{^ — s). n' can 
be chosen large enough so that also n > uq. The rate of this 
code is 


logM 

n' 


> P(1 


e) 


logM 

n 


>p(i 


e)(P — s) = pR — e', 


where s' = p£(l + P — e). We showed that for n' large 
enough we can obtain P^^^ < s with rate close to pR, 
i.e. pR is achievable for the RBR channel, which implies 
P ■ — C'rbr- Since we showed this for arbitrary N, we 

conclude that p ■ Cdp = p • sup^>i < Crbr. ■ 


B. Crbr < p ■ Ccip 

We show that any achievable rate for the RBR channel is 
e-achievable on the (7V)-clipping channel for some N large 
enough. Fix e > 0. Let R < Crbr be an achievable rate for the 
RBR channel. Then there exists rig such that for all integers 
n' > Tig there exists an {M,n') code with p^^R ^ ^^2 and 
^ >R-s. 

Fix n' > ti'q and N. We suggest the following (M, n) 
coding scheme for the (iV)-clipping channel; consider time 
i, 1 < i < n. For a given sequence of past clipping lengths 
denote the arrival times P, = 1 + X]'fc=i Pk for j = 
0,1,..., i — 1. Define the sequence , E^}) 

as follows (this sequence is different for different Fs): 

^(i) _ f-B 5 f = Po) 7l, r2, • ■ • ) Pi-1 

^ 10 ) Otherwise 


Note that E^'^ = 0 for all ^ > Pi_i. It is possible for some 
values of Tj to be larger than n' - these values are simply 
ignored. At time i, the transmitter sends the codeword 
whose j-th component, 1 < j < W, is defined as 




,Pi-i + i - 1 < n' 

0 

,Pi-i + j -l>n' 


where Xt = ft, the encoding functions of the RBR channel 
code, defined in Q. Since = B and E^f* = 0 for 

^ > Pi_i, by the definition of the RBR channel energy con¬ 
straints ([T]i and (|2]), it is evident that \\xl^^\w, < B. 

The receiver observes and Li, and forms a concatena¬ 
tion of the non-clipped outputs. More precisely, if Li < N, set 
pTi-i-i-i-i = J = 1) • • •) -Bi- ff Pi > W, the receiver 

declares an error for the entire transmission. At the end of time 
n, the receiver decides on message W = Wi/Y'" ), 

where W = g, the decoding function of the RBR channel 
code, defined in (|4]l. 

The probability of error analysis here follows the same lines 
of Section HV-AI Denote the event 

n 

Po = {Tn < n'} U |J{P, > N}. (20) 

i=l 

We separate between the case Eq, for which we can show that 
the decoder gives vanishing probability of error, and the case 
Eq, the probability of which can be upper bounded by s/2. 

We can bound the probability of error for the (A^)-clipping 
channel as follows; 


pcip.(iv) ^ ^wr]E/^) + Pt{w ^wnEo) 

<Pr(VF^fL n Pg)-fPr(Pg). ( 21 ) 

To bound the first term, consider an {M,n') RBR channel 
code. For a given message w and energy arrival process 
realization e" , the input a;" is uniquely determined and 
therefore also the transition probability p(t/" |a;" , e" ). Denote 











-Frbr(iS|w, e" ) as the probability of error, so that the total 
probability of error for the RBR channel is 

1 “ 

= M E E )P^BK{£\w,e- ), 

w=l e"' 

/ 

where P(e" ) is the probability of the energy arrival process, 
as described in Section IIV-AI Choose n' large enough so that 

< e/2. 

The first term in (ISTT i can be written as 

M 

Pr(ic ^WnE^o) = j^Yl E 

w = l 

127=1 h>n' 

li<N Vi 

where P{E) is the probability of n independent clipping 
lengths as in Section HV-AI and P) is the probabil¬ 

ity of error for the clipping channel using the scheme described 
above. 

For any given E, there is a corresponding sequence of 
length A = X]r=i ^ B}^, as defined by (fT^ . For E 

such that X]r=i k < N, i = 1,... ,n, denote by 

P" (E) the first n' elements of the corresponding e^. Then 

p^^\s\w,n = p^ji^{£\w,E^'{n). 

Note also that P" (Z”) can have at most n non-zero elements 
(i.e. energy arrivals). 

For a fixed w, we write 

E pr)pS\^\^^n 

Er=i U>n' 
li<N Vi 

= E E p{npRBR{£\w,e-') 

e^': _ n / 

at most n B's Ei=i 

li<N Vi 

= E PRBR{£\w,e^') E 

at most n B’s IIir=l h>fJ. 

li<N Vi 

E’'’{l")=e"-' 

< E ^RBR(fk,e"') E (22) 

e"': _ n , 

at most n B’s 

Now, observe that a sequence e" with exactly i energy arrivals 
(with ei being the first energy arrival by assumption) can be 
described by z — 1 epoch lengths where Ij is the 

time between the j-th and (j -f l)-th energy arrivals. The last 
epoch length is always li = n' — h ■ Then, a sequence 

Z" satisfies E^' (E) = e"' if and only if 

~ J “ 1 ; • ■ • ) * 1 ) 

^i 22 ^i- 


For example, say e® = 100010100. Then any sequence Z" for 
which Zi = 4, Z 2 = 2, and Z 3 > 3 will satisfy i?®(Z") = e®. 
Fix e" with i energy arrivals. Then, since ei = B w.p. 1, 

P(e"') =p*-i(l-p)"= lf[p{l-p)h-^ = -f[P{l,), 

^ 1=1 ^i=i 

where P{lj) is the probability of a geometric RV with param¬ 
eter p. We can write the sum in (l22l i as 

E pin= E 

i": 

E"=i4>n' 

_E"'(r)=e"' 

i-1 

i=i 

= P(e'^'), 


where the last step is because for a geometric RV, 

00 

Pr(P > £) = ^p(l - = (1 - = - Pr(P = i). 


k=l 


P 


Plugging this back into the original expression for the error 
probability: 

Pr(lF^lV n Pg) 

1 ^ , 
^IjE E ^^RBR(fKe")P(e") 


W—1 


at most n B's 


^/^EE^>^BR(£k,e"')P(e"') 


M 


.RBR 


M 

= P' 

< e/2. 

The second term in (ISTT i can be upper bounded by applying 
the union bound and the law of large numbers: 


Pr(-Eo) < Pr(rn - 1 < n') P ^ Pr(L^ > N) 

see 

“ 4 4 ~ 2 ’ 


(23) 


for n large enough if n'/n < EL = say — = and 
N > We get that there exists N 

for which < g for n sufficiently large. The rate of 

this (M, n) code is 

logM _ l-elogM .l-e A _ ^ / 

— \it Sj e , 

n p n p p 

where e' = s ■ Next, we have from Fano’s inequality: 

R , log M 
- e < - 


n 













which implies 


< iY^^'>r\L^) + h 2 {e)] 


1 

< - 

1 — e 


MN) 

'-^clp + 


h2{£) 

n 


1 

< - 

1 — e 


C'clp + 


h2{£) 

n 


Taking e —0, and observing that this applies to any 
R < Crbr, gives Crbr/p < Cdp. ■ 


V. Noncausal Side Information Strictly 
Increases Capacity: Proof of Proposition [T] 

We would like to show that Crbr < Crbr, noncausal, where 
Crbr and CRBR.noncausai are given by ([T0| and ([TT]l respectively. 

Let gnix^) and fk{x^) be the maximizing distributions 
in ([Tol l and (fTTT i respectively, that is: 

p„(x")= argmax i V (1 - I , 

p(x"): ||X"P<B J 

(24) 

fk{x'^)= argmax /(X'=;y'=). (25) 

p(rc'“): ||X''||2<B 

The maximizing distributions fk{x^) are unique and are found 
explicitly in ll22l - ll24l . For any 1 < I < /c, let fkix^) denote 
the marginal distribution of fk{x^), that is 

fk(x^) = J fk{x'')dx^+i, 

and similarly for gn{x^). Denote /(p(X^)) as the mutual 
information I{X^; + computed with input distribution 

p{x^). 

Using this notation, we rewrite (doll and ([IT]i as follows: 

n 

Crbr= lim Vp 2 (i_p)'=-ij( 5 „(x'=)), 

n—¥oo 

n 

Crbr, noncausal = 1™ V p'(1 - '/(/fe (X'')). 

n—^co 

Obviously Crbr < CRBR.noncausai- Suppose that also Crbr = 
Crbr, noncausal- This will imply 

n 

hm Y.{l-pf-^[I{fk{X>^)) - I{gn{X^))\ = 0 . 
k^\ 

Since each term in the sum is non-negative, we get in particular 
that the sum of the first two elements must vanish, or 


lim {J(p„(Xi)) + (1 -p)/(p„(X2))} 

n—¥oo 

= /(/i(Xi)) + ( 1 -p)/(/ 2 (X 2 )). 

Next, consider g 2 {x^) as dehned in (l24ll . Since ||X’^|p < B 
implies ||X^|p < B, we get for every n > 2: 

/(p„(Xi)) + (l-p)/(p„(X 2 )) 

< max {/(Xi;yi) + (l-p)J(X2;r2)} 

= /(P2(Xi)) + (1-p)J(p2(X2)), 


/(/i(Xi)) + ( 1 -p)J(/ 2 (X 2 )) 

</(S 2 (Xi)) + ( 1 -p)/(p 2 (X 2 )). (26) 

Next, since ||X^|p < B implies |Xip < B, we get 
that /(p 2 (Xi)) < I{fi{Xi)). Substituting in (l26T l. we get 
/(/ 2 (X^)) < I{g 2 {X‘^)). Since f 2 ix'^) is the unique maxi¬ 
mizer of I{X‘^;Y^), this implies f 2 ix^) = g 2 {x‘^)- Substitut¬ 
ing this back in (|2^ . we get I{fi{Xi)) < I{g 2 {Xi)). Again, 
from uniqueness, this implies fi{x) = g 2 {x). Together, we 
see that 

fl{xi) = g 2 {xi) = j g 2 {xi,X 2 )dX 2 = J f 2 {xi,X 2 )dX 2 , 

which is a contradiction, since fi{xi) is discrete ll 22 l . whereas 
f 2 ixi,X 2 ) has discrete amplitude and uniform phase ll^ . 
Therefore we must have Crbr < CRBR.noncausai- 

VI. Capacity Bounds: Proof of Proposition [2] 

In this section we will develop the upper and lower bounds 
to Crbr, as given in Proposition |2] 


A. Upper Bound 

We can relax the energy constraint in (fTOl i to be only in 
expectation, thus giving an upper bound: 


N 


Crbr< lim max p^{I - pf-^I{X^] X^ + Z^) 

x->oo p(x^y. 


- k=l 


E||X" |F<B 

N k 

< lim max p^{l — I{Xi] Xi + Zi) 

E||X^|p<B 
= lim max 

N^oo p{x^): 


i=l 


N 


^p(i - py-^ [1 - (1 - p)^-^+yi{xp, X, + zy 


2=1 


N 


< lim max y^p(l-p)® ^I{Xi;Xi + Zi) 
E\\X^f<B 


N 


= lim max ^p(l - p)* log(l-f fi), 

£i>0 ,i=l.....A 


where the last equality is obtained by choosing Xi ~ V(0, Ei) 
independent of each other. This gives the RHS in (fTZt . ■ 


B. Lower Bound 

To lower bound (fTOb . we can choose a suboptimal dis¬ 
tribution for which the X^’s are independent, i.e. p{x^) = 
n^iP(a:i), and each of them satishes |Xip < Ei a.s. for 
some Ei > 0. To satisfy the total energy constraint we must 
have — R- Under this input distribution, we have for 










every i\ I[Xi; Yi) < ^ log(l Si) < ^ log(l + B), and thus 
for every N: 

N 

+ Z^) 

k^l 

N 

= J2p{i- py-^[i - (1 - pf-^+yi{Xf,X, + Z,) 

2^1 

N 

>J2p{^-pY~^Hx^;X, + z,) 

2^1 

-N{l-p)^^log{l + B). 

Taking N oo, the second term vanishes, and we are left 
with the following lower bound; 

N 

C'rbr > lim max p{l - py~^ I{Xi; Xi + Zi). 

Si>0 


Since p{xi) was arbitrary, we can choose it to maximize 
I{Xi]Xi + Zi). We obtain 

N 


C'rbr > 


lim max 

N-^aD 

Si>0 

T,y=i £i<B 


y]p(l-p)*-iCs™th(C*), (27) 
2 = 1 


where 

Csmith(C)= max /(X;^) ( 28 ) 

p{x): X^<S 

is the capacity of the amplitude constrained scalar AWGN 
channel studied in ll22l . where the optimal value for this 
mutual information maximization problem is found. Unfor¬ 
tunately, it is not tractable. Hence, as done in lfT9l , Il26l , we 
lower bound it as follows: 

Cs„i.(£)>fiog(i + |)-5iog(y) 

> i log(l + C) - ^ log ( y) • (29) 

Plugging this into dZTl l gives the LHS of (fT^ . 
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Appendix A 

Noncausal Capacity; Proof of Theorem[2] 

With noncausal energy arrival information, the RBR chan¬ 
nel model remains the same except for equation Q, which 


becomes 




The (A^)-clipping channel now has causal knowledge of the 
clipping lengths at the transmitter. It is a well known fact that 
with i.i.d. side information available at both the transmitter and 
the receiver, the capacity is the same whether it is available 
causally or noncausally. Therefore we can repeat the steps of 
Section HVl while altering the definition of the (A^)-clipping 
channel by replacing (fTTl) with 

concluding that 

CRBR.noncausal — P * I^clp.noncausal ■ (30) 


The capacity of a memoryless channel with i.i.d. side 
information available at both the transmitter and the receiver 
is given by (see e.g. Il27l Chapter 7]) 

CcSoncausal = ^ax I \L). 

p(£W|i): 

Then, following the same steps leading to (l8]l in Section HIH 
we get 


C. 


(AT) 


clp,noncausal 

CXD 

= y"p(l-p)'="^ max = fc) 

||X(-?^)|| 2 <^ 


N 


= yp(l-p)'=-i max + Z^). 




Finally, we have 


C*RBR,noncausal P ^clp.noncausal 


N—¥oo 


= yp2(lmax I{X^;X'^ + Z'^). 


k=l 


which is the expression in (fTTl) . 


Appendix B 

Noncausal Capacity Bounds: Proof of 
Proposition!!] 

We prove Proposition |3] following the same steps as in the 
previous sections. For the upper bound, we can similarly relax 
the energy constraint in (fTTIi to be only in expectation, thus 
giving an upper bound: 

CXD 

Crbr, noncausal < V p"(l " max /(X''; X’^ + Z'^) 

¥.\\X'‘\f<B 

CXD , 

= Y^p\l-pf-^-\og{l + B/k), 

fc=l 


where the last line is a known result for vector Gaussian 
channels. 

We derive a lower bound for (ITTI i b^ considering a subopti- 
mal input distribution. Let p(x^) = 0^=1 Pkixi), where pk{x) 
is some distribution for which < B/k a.s. We then have 
the following lower bound: 

CXD 

C'rbR, noncausal > ^^P^ - p)^~^ / k), (31) 

k^l 

where CsmithfiS) is defined in (1281 ). The expression has been 
evaluated numerically using the algorithm suggested in ll22i 
and plotted in Figure |3] 

Next, using (1291 ), we can further lower bound (I 3 TI 1 to 
obtain (fTsT l. 


Appendix C 

Optimal Online Power Control 

We solve the maximization problem in (ITsT l. Writing the 
problem in standard form and using KKT conditions, we have 
for i = 1,..., X: 


with Ai,A > 0 and the complementary slackness conditions: 
\£, = 0 and £^-B)= 0. 

To obtain the non-zero values of £i, we set Ai = 0: 


c. p(l-py . 

* 2A 


(32) 


Since £i > 0, this implies A < p^^~p^ — for all i for which 
£i > 0. This is a decreasing function of i, therefore there exists 
an integer N such that £i>Q for i = 1,... , X and £i = Q 
for i > N. Since we are solving this problem for X 00 , it 
is safe to assume X < X. 

Next we apply the total energy constraint (which must hold 
with equality, since increasing £i for any i will only increase 
the objective): 


N 


N 




pj^-pT 

2A 


-1 = 


1 - (1 -p)^ 

2A 


- X 


A = 


1 - (1 -p)^ 

2(i? + X) 


(33) 


Since A < 


p(i-p)* 


A > 


pO-pT . 


for all i = 1,..., X, we must have 


1 - (1 -p)^ ^ p{l-p)^ 


2{B + N) ■ 2 

1 > {l-pf[l+p{B + N)], 


which implies X is the smallest integer satisfying this inequal¬ 
ity. ■ 












